Text Gestalt: Stroke-Aware Scene Text Image Super-resolution

نویسندگان

چکیده

In the last decade, blossom of deep learning has witnessed rapid development scene text recognition. However, recognition low-resolution images remains a challenge. Even though some super-resolution methods have been proposed to tackle this problem, they usually treat as general while ignoring fact that visual quality strokes (the atomic unit text) plays an essential role for According Gestalt Psychology, humans are capable composing parts details into most similar objects guided by prior knowledge. Likewise, when observe image, will inherently use partial stroke-level recover appearance holistic characters. Inspired we put forward Stroke-Aware Scene Text Image Super-Resolution method containing Stroke-Focused Module (SFM) concentrate on internal structures characters in images. Specifically, attempt design rules decomposing English and digits at stroke-level, then pre-train recognizer provide attention maps positional clues with purpose controlling consistency between generated image high-resolution ground truth. The extensive experimental results validate can indeed generate more distinguishable TextZoom manually constructed Chinese character dataset Degraded-IC13. Furthermore, since SFM is only used guidance training, it not bring any time overhead during test phase. Code available https://github.com/FudanVI/FudanOCR/tree/main/text-gestalt.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Super-Resolution Enhancement of Text Image Sequences

Abstract The objective of this work is the super-resolution enhancement of image sequences. We consider in particular images of scenes for which the point-to-point image transformation is a plane projective transformation. We first describe the imaging model, and a maximum likelihood (ML) estimator of the super-resolution image. We demonstrate the extreme noise sensitivity of the unconstrained ...

متن کامل

Corpus based coreference resolution for Farsi text

"Coreference resolution" or "finding all expressions that refer to the same entity" in a text, is one of the important requirements in natural language processing. Two words are coreference when both refer to a single entity in the text or the real world. So the main task of coreference resolution systems is to identify terms that refer to a unique entity. A coreference resolution tool could be...

متن کامل

Text Super-Resolution: A Bayesian Approach

We address the problem of text super-resolution: given an image of text scanned in at low resolution from a piece of paper, return the image that is mortly likely to be generated from a noiseless high-resolution scan of the same piece of paper. In doing so, we wish to: (1) avoid introducing artifacts in the high-resolution image such as blurry edges and rounded corners, (2) recover from quantiz...

متن کامل

An Introduction to Super-Resolution Text

This chapter examines the field of super-resolution with application to text analysis. While the area of super-resolution has been dealt with in fair depth in recent years, it is only just becoming useful as an applicable stage in improving text images, particularly for further processing, transmission, and understanding on mobile and handheld devices. After dealing with the general concepts of...

متن کامل

Super-resolution Text using the Teager Filter

We propose a super-resolution technique specifically aimed at enhancing low-resolution text images from handheld devices. The Teager filter, a quadratic unsharp masking filter, is used to highlight high frequencies which are then combined with the warped and interpolated image sequence following motion estimation using Taylor series decomposition. Comparative performance evaluation is presented...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

ژورنال

عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence

سال: 2022

ISSN: ['2159-5399', '2374-3468']

DOI: https://doi.org/10.1609/aaai.v36i1.19904